Assessing the Effects of Symmetry on Motif Discovery and Modeling

نویسندگان

  • Lala M. Motlhabi
  • Gary D. Stormo
چکیده

BACKGROUND Identifying the DNA binding sites for transcription factors is a key task in modeling the gene regulatory network of a cell. Predicting DNA binding sites computationally suffers from high false positives and false negatives due to various contributing factors, including the inaccurate models for transcription factor specificity. One source of inaccuracy in the specificity models is the assumption of asymmetry for symmetric models. METHODOLOGY/PRINCIPAL FINDINGS Using simulation studies, so that the correct binding site model is known and various parameters of the process can be systematically controlled, we test different motif finding algorithms on both symmetric and asymmetric binding site data. We show that if the true binding site is asymmetric the results are unambiguous and the asymmetric model is clearly superior to the symmetric model. But if the true binding specificity is symmetric commonly used methods can infer, incorrectly, that the motif is asymmetric. The resulting inaccurate motifs lead to lower sensitivity and specificity than would the correct, symmetric models. We also show how the correct model can be obtained by the use of appropriate measures of statistical significance. CONCLUSIONS/SIGNIFICANCE This study demonstrates that the most commonly used motif-finding approaches usually model symmetric motifs incorrectly, which leads to higher than necessary false prediction errors. It also demonstrates how alternative motif-finding methods can correct the problem, providing more accurate motif models and reducing the errors. Furthermore, it provides criteria for determining whether a symmetric or asymmetric model is the most appropriate for any experimental dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of an Efficient Hybrid Method for Motif Discovery in DNA Sequences

This work presents a hybrid method for motif discovery in DNA sequences. The proposed method called SPSO-Lk, borrows the concept of Chebyshev polynomials and uses the stochastic local search to improve the performance of the basic PSO algorithm as a motif finder. The Chebyshev polynomial concept encourages us to use a linear combination of previously discovered velocities beyond that proposed b...

متن کامل

Analysis of the Effects of Hyperthermia with Magnetic Nanoparticles on Cancer Tissues

Introduction: Hyperthermia is one of the noninvasive methods of treating cancer. In this method, heat can be generated in several methods. One of these methods is injecting magnetic nanoparticles as a solution into the tumor site and place it in a magnetic field. Methods: The study was analytical one, modeling was performed using computational methods, and in vitro experimental data were used ...

متن کامل

A Discussion on Concept of Symmetry snd Asymmetry

This paper is concerned about the concept os asymmetry. The different types of asymmetry for univariate and multivariate distributions have introduces been considered as well as some of usual asymmetry criteria. A brief overview of method for adding the capability of modeling asymmetry to a symmetry distribution is also a secondary purpose of this paper.

متن کامل

Effects of asymmetric stiffness on parametric instabilities of rotor

This work deals with effects of asymmetric stiffness on the dynamic behaviour of the rotor system. The analysis is presented through an extended Lagrangian Hamiltonian mechanics on the asymmetric rotor system, where symmetries are broken in terms of the rotor stiffness. The complete dynamics of asymmetries of rotor system is investigated with a case study. In this work, a mathematical model is ...

متن کامل

The use of optimization algorithm for assessing effects of Carboxyl Functionalized MWCNTs on the productivity of nidltrusion process

Among the several available techniques to produce the braided composite rods for construction industry, nidltrusion process is becoming the most widespread and cost-effective continuous processing technique. The work mentions the influence of carboxyl functionalized multiwalled carbon nanotubes (MWCNTs) on the maximum speed of manufacturing process. The epoxy polymer is diglycidyl ether of bisp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2011